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A SIGNIFICANCE TEST FOR MINIMUM RANK IN 
FACTOR ANALYSIS 


PAUL G. HOEL 
University of California at Los Angeles 


This paper presents a new method of determining the minimum 
rank in factor analysis, appropriate to the principal axes solution. 
The new method is compared with a former method which, with some 
adjustment, is more convenient for the centroid approach. Both 
methods are applied to two familiar examples. 


The question of when to stop in the process of extracting factors 
from a correlation matrix seems to be answered in common practice 
by calculating the mean absolute residual correlation and comparing 
it with the standard error of a simple correlation coefficient. Unfortu- 
nately, this test is not used correctly under the assumptions implied 
by its use; and even if it were, it would not ordinarily give a satisfac- 
tory answer. There is a tendency for psychologists using this test to 
underestimate the minimum number of significant factors needed to 
describe their data. This is particularly true if the correlations are 
based on at most 200 observations, because under such circumstances 
the standard error of a correlation coefficient is large, while it takes 
comparatively few factors to reduce a set of positive correlations to 
a set of residuals satisfying the customary standard error criterion. 
As an illustration of the situation, a 10-rowed matrix containing posi- 
tive elements drawn from a table of random numbers was factored by 
the centroid method. The values of the mean absolute residuals turned 
out to be .477, .218, .176, .135, .122, .075, .067, corresponding to the 
extraction of 0, 1, 2, 3, 4, 5, 6, factors respectively. If common prac- 
tice were followed, not over 4 factors would be considered significant 
for a sample of 100, and not over 5 for a sample of 200. 

In a previous paper (1) the author presented a method for hand- 
ling this question, having particularly in mind factor analysis based 
upon the principal axes approach. However, with a slight change, this 
method turns out to be more suitable for the centroid approach. The 
present paper develops another method which is more convenient 
when principal axes are used, and which is interesting for its own sake 
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because of the relationship it brings out between these two approaches 
to the subject. The paper proceeds by developing this second method, 
making the necessary change in the first method, and then applying 
both methods to two well-known examples. 


SECOND METHOD 


Consider a set of n statistical variables z,, z,---, Zn, With zero 
means, which are subject to a significant error of measurement. Each 
variable z; may be thought of as the sum of a true variable x; and an 
error variable y;. It will be assumed that the error variables are un- 
correlated with each other and with the true variables. With these 
assumptions, and with E denoting the expected value, it follows readi- 
ly that 


mi; = E(z,2;)) (a +y;)) (435), 
m, = E(z?) = = E (2?) + E(y*) 


= Mi + mii, 


m” 
= mis — mM", = Mii (1———) = mii (1— Bi) , 
Mii 
my Mi; 
Vmill— Bi)m;(1— 


1 — 


The notation here is somewhat simpler than that customarily em- 
ployed with the centroid method. Here z; is the test score, x; the com- 
mon factor part of it, and y; the specific and error factor parts com- 
bined. Hence 7’;; is the correlation between the common factor parts 
of tests i and j, while 7;; is the usual correlation. In the customary 
centroid notation, 2”;; = 7;;/hih;, where h;? is the communality for 
test 7; consequently h;? = 1— #; expresses the relationship between 
the present notation and the usual centroid notation. 

The test obtained in this paper hinges upon an inequality found 
in studying properties of the common factor correlation determinant 
\r’;;|. By means of (1), |7’:;| may be expressed in terms of the r;; and 
8; . If the denominators are factored out from each row and column of 
it can be expressed as 1/(1 — (1 — --- (1— Bn) multi- 
plied by the determinant 


(t~)). (1) 
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Tor 1— Ten 
F = (2) 
Tra Tre 1— 


Considered as a function of the §;, F may be expanded by means 


of Taylor’s formula about some convenient point P(f,,f2, ---, Bn). 
The derivatives of F are found easily since F is linear in any one vari- 


oF 
able. Thus ae 


deleting row and column i, 


= F);, , where F);, denotes the minor of F obtained by 

= F)i; (7), where F);;, is the 
minor obtained by deleting rows and columns i and j. Substituting 
these and corresponding formulas in Taylor’s expansion for 7 
variables 


F=F— 


“Ma 


Pyic (Bi — BY) +E (Bi — BY 
(3) 
+ (—1)"(8, — B:) (Bn — Bn) 


where F is the value of F at P, etc. 

The equation F = 0 represents a surface in a space of m dimen- 
sions. In order to study this surface, consider its intersections with a 
line L passing through P and having direction numbers @,, a2, --+, Qn, 
all of which are positive. The equation of L may be written as 


bi — Bi = ta; (¢ = 1, 2,---,”). (4) 


To find the points of intersection of L with the surface, it is necessary 
to substitute (4) in (3) and solve the resulting equation (5) for t. 
Thus 


0=F—ty Py aa;+---+ (5) 


i<j 


Now the nature of the surface F = 0 will be determined by the nature 


of the roots of (5). 
First, it will be shown that the roots are real. Consider the 


j 
| 
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symmetric determinant 


| 
Va,a, Vaya, | 
1—Af, 
| Va,0, Vann 


If ¢ is subtracted from each diagonal element of G, and the resulting 
determinant, H, is expanded by means of (3) about the origin, where 
now 
i<j 
But upon factoring out the denominators in G, G)i,, etc., it becomes 
evident that 


F Py a; 
336 
+++ Ay +++ Ay 
Hence, 

— SF ,;, a; 2S a; +--- 
[ > iti A; + 
+ (—#) ay]. 


This shows that the roots of (5) are identical with those of H = 0, 
which is the characteristic equation of G. Since it is well known that 
the roots of the characteristic equation of a real symmetric matrix are 
all real, the roots of (5) must be real for every point P and every set 
of positive numbers a; . 

Secondly, it is known that the n roots of (5) are continuous 
functions of the 8; and the a; which enter in the coefficients of the 
equation and which serve to determine the line L. 

Because of these two properties, it follows that every line with 
positive direction numbers intersects the surface in exactly n real 
points, if a root of multiplicity q is considered as corresponding to q 
coincident points on the surface, and that the surface consists of n 
continuous sheets. If the a; are held fixed and the ; allowed to vary, 
the resulting system of parallel lines, by their intersections with the 
surface, will serve to describe these n sheets. These n sheets may not 
be distinct at all points, since (5) may not possess distinct roots for all 
values of the 8; . However, if each sheet is numbered according to the 


i 
vag 2. . 


PAUL G. HOEL 249 


order of magnitude of the corresponding root of (5) where the roots 
are distinct, there is no ambiguity in speaking of the sheets as though 
they were distinct. Furthermore, the continuity of the roots insures 
that no line with positive direction numbers can intersect the same 
numbered sheet more than once. Otherwise there would exist a line 
parallel to the line in question intersecting the surface in, usually, two 
coincident points, in the neighborhood of which the roots of (5) 
would not all vary continuously. This excludes the possibility of any 
sheet being closed, say, like an ellipsoid. As a matter of fact, the con- 
tinuity of the roots restricts the nature of the surface to the extent 
that no line with positive direction numbers can be tangent to it. 
These properties of the surface suggest the Theorem: If F is of rank 
n —q for some point P(f, ,--- Bn) in the all-positive region, the q-th 
largest characteristic root of the correlation matrix can not exceed the 
largest of the numbers ; . 

Since F is of rank n — q at P, all principal minors of more than 
nm — q rows vanish at this point. Hence all F’s with less than q sub- 
scripts vanish and ¢ = 0 will be a root of multiplicity qg of (5). Con- 
sequently, at this point gq of the n sheets of the surface coincide. It is 
sufficient to assume that this g-tuple root is the smallest root of the 
equation, otherwise the theorem could be proved true for a larger 
characteristic root. If the theorem were false, it would be possible to 
find a line with positive direction numbers which would pass through 
the point P at which q of the sheets coincide and the point on the q-th 
sheet where the line 6, = 6, = --- = §, intersects it to produce the q-th 
characteristic root. However, previous considerations exclude the 
possibility of any such line intersecting the same sheet twice. 

If k, represents the g-th characteristic root and £» the largest of 


the numbers ,, -:: , Bn, then the theorem states that k, < Bn. Since 
h? = 1—8;, this is equivalent to 


kg+ Nn? <1, (6) 


where h,,? is the smallest of the communalities. This inequality is 
true for population parameters only. In order to apply it to a sample, 
it is necessary to consider its sampling error. 

In general if z = x + y, then o,? = o,? + 2reze, + o,?. If r is 
unknown but positive, the best that can be found as an upper bound 
for o- iso, < o,-+ o,. If z < 1, then for z approximately normally 
distributed, the odds are better than 19 to 1 that a sample value of z 
will not exceed 1 + 1.65(o, + oy). 

Geometrical considerations show that sample values of h,,? and 
k, are positively correlated, so that the above statements apply here. 
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Consequently, with temporary normality assumptions and with capital 
letters representing sample values, the odds are at least 19 to 1 that 


K,+Hn? <1+1.65(0, 


Now H,,? and K, are both variances. If the basic variables, that 
is, the ability scores which produce the test scores, are assumed to be 
normally distributed, the standard error of a variance may be em- 
ployed here to produce odds of at least 19 to 1 that 


Ky+ Hn? < 14 1.65V2/N (ka + lin’). (7) 


For large samples the population values k, and h»? may be replaced by 
their sample values. With this substitution, the inequality finally takes 
the form 


K,+ 8.2 < (8) 


~ 1—1.652/N 

This inequality underestimates the actual relationship at several 
stages; consequently, sample values close to this significance level 
should be judged significant, while values beyond this level should be 
considered highly significant. 


Comparison of Methods 
The former method was based on Hotelling’s paper (2) in which 
no specific factors were introduced, and hence must be altered slightly 
to fit the more general approach to the subject. In the notation of the 
author’s first paper, 7’i; =7i; (1+ 4:)(1+4;) , where the 4; were 
chosen for their mathematical convenience rather than their psycho- 


logical meaning. In the present notation 7’;; = i . Hence, when 
iN; 


1 
specific factors are considered, 4; = oo 1 , as in contrast to the 


1 
statement made in that paper that 4; = —-—1, which is valid only 


when specific factors are ignored. Consequently, by merely replacing 
reliability coefficients (7;) by communalities (h;?), the material of 
that paper becomes valid for the usual factor analysis setup. With 
this alteration, the results of that paper can be condensed into 


W = 1), (9) 


Z 
—> 
=a 


Q3<...<On  a3<...<dn 


| 
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where the sums in the denominator continue until the final term 
(1 — h,?) --» (1 —h,?) is reached, |7;;| is the correlation determinant 
with diagonal elements unity, N is the size sample, 7 is the number of 
tests, » — q is the postulated minimum essential number of factors, 
and W > 1.65 produces odds of at least 19 to 1 against the truth of 
the postulate. 

The results of the present paper can be condensed into 


1— 1.65\/2/N 
where K, is the q-th characteristic root of the correlation matrix with 
diagonal elements unity, H,,? is the smallest of the communalities, 
nm — q is the postulated minimum essential number of factors, and the 
odds are at least 19 to 1 against the truth of the postulate if the 
inequality is not satisfied. 

The assumptions upon which the proofs of these two methods de- 
pend are quite similar. Each requires large samples and approximate 
normal distributions in the basic variables. The restrictions of the 
present method are perhaps less severe, but it requires larger samples 
to give one the same confidence in its conclusions. In each case it is 
necessary to replace population parameters by sample estimates. 


. ILLUSTRATIVE EXAMPLES 
A. The following example was judged by Holzinger (3) to re- 
quire but one significant factor. 


1 .059 195 297 


1 149 284 .249 
1 466 .669 
1 
| 1 


Calculations by standard methods give |7;;| = .289; H;? = .248, .245, 
.675, .746, .797; K; = 2.45, .95, .84, .58, .28. Here N = 140 and 
n= 5d. \ 
Method 1. Assume one factor is sufficient. Then the rank is 1 
and q = 4. Here the denominator of (9) is 
> (1—H* ) )H? + (1 — H,,”) --- (1— = .101, 


Q3<...<a5 


and W = V/140/2(5) (2.36 —1) = 5.1. 


This value of W is highly significant, so that there is no doubt but 
that the hypothesis is false, and hence that at least two factors are 


—y 
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necessary. If it is assumed that two factors are sufficient, a non-sig- 
nificant value of W is obtained. 
Method 2. Assume one factor is sufficient. Then K, = .95, 
H,? = .24, and (8) becomes 
1 


1 — 1.65\/2/140 


This is barely satisfied. Because of the many inequalities used to ob- 
tain this final inequality, such a borderline case should be considered 
as significant. Although the conclusion that at least two factors are 
necessary is obtained by both methods, the first method arrives at the 
result decisively. 

B. The following example was judged by both Holzinger and 
Thurstone to require but one significant factor. 


| 1 58 58 51 48 43 48 46 
| 1 47 50 53 .40 .44 .45 
| 1 48 .41 .45 .38 
| 1 .34 .50 .85 .40 

1 .33 .41 .39 


1 41 .33 
i 
| 


Calculations give |7;;| = .06; H;? = .656, .639, .571, .587, .559, .522, 
.434, .377; K; = 4.08, .77,---. Here N = 127 and n = 8. 

Method 1. Assume one factor is sufficient. Then gq = 7 and the 
denominator of (9) is 


The corresponding inequality becomes Z/¢ > .83. Since a value of 
Z/ < 1 cannot yield a significant result with this test, the hypothesis 
is not discredited here. 

Method 2. Assume one factor is sufficient. Then K; = .77, H,,? = 
.38, and (8) becomes 


1.26 . 


1 
17+ .38< =1. 
1.65\/2/127 


This is not as close as in the previous example, yet it is sufficiently 
close to make one doubt the truth of the hypothesis. Hence to be on 
the safe side, two factors should be considered. 


| 
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CONCLUSIONS 


The former method is convenient when the centroid technique is 
used, since the communalities are then available and it is only neces- 
sary to calculate the value of the correlation determinant. The present 
method is convenient when the principal axes technique is used, since 
the characteristic roots are then available and it is only necessary to 
find a suitable estimate of the smallest communality. The usual pro- 
cedure of selecting the largest element in a given row as a first esti- 
mate of the communality for that row is not permissible here since 
such an estimate will invariably overestimate the true value. However, 
one of the more refined methods available will usually give a satisfac- 
tory estimate. A comparison of the two examples above indicates that 
the superiority of one test over the other varies with the problem. 
There are reasons, however, for believing that the first method is apt 
to be the more sensitive; and certainly it is easier to calculate the 
value of |7;;| than it is to calculate a number of characteristic roots 
if the technique employed does not yield them. 
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NEURON CIRCUITS: THE SELF-EXCITING NEURON 


H. D. LANDAHL AND A. S. HOUSEHOLDER 
The University of Chicago 


Conditions are determined under which a single self-exciting 
neuron can reach a state of permanent excitation, in the case that 
the neuron develops both an excitatory and an inhibitory substance as 
well as in the case that it develops only a single, excitatory substance. 
In case both substances are developed it is possible to have periodic 
solutions, a possibility which does not arise when the circuit consists 
of neurons, or of a neuron, developing only an excitatory substance. 


1. Formulation of the problem. It is known that the nervous 
system contains numerous instances of pairs of mutually exciting 
neurons. Rashevsky (1, Ch. XXV) has shown theoretically that such 
a pair may be brought to a state of permanent excitation and so may 
furnish the necessary hysteresial element in the formation of a con- 
ditioned reflex. However, one finds instances also of single neurons 
possessing dendrites which reach forward and appear to make synap- 
tic contacts with collaterals to their own axons, thus forming mono- 
neuronic circuits.* It is the purpose of the present paper to discuss 
theoretically a circuit of this simple type (Fig. 1). 


h 


h 


> 


FIGURE 1 


In notation and method we shall follow Householder’s discussion 
(2) of the circuit composed of two neurons, a discussion which elab- 
orates, somewhat, upon that of Rashevsky mentioned above. The ter- 
minology and theoretical assumptions are Rashevsky’s. 

Whereas Rashevsky and Householder considered only the case 
that each neuron develops a single substance, e«, we wish to consider 
also the development of ¢ and j, an excitatory and an inhibitory sub- 
stance. For the self-exciting neuron, the one-substance problem leads 

*In the figures we introduce a dot to represent the nucleus so that all im- 
pulses on one side of the nucleus travel toward it (dendrites), and on the other 


side away from it (axons or their collaterals). In complex circuits this convention 
may be especially useful. 
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to a single differential equation of the first order which is solvable by 
a quadrature. 

We assume, then, with Rashevsky, that the neuron, upon applica- 
tion of a stimulus of intensity S at any dendrite, becomes excited in 
degree E(S), where E is some function which is monotonically in- 
creasing and which is zero unless S exceeds a certain threshold value 
h. Thereupon the neuron develops the two substances « and j at the 
extremity of the axon or of a collateral. The development of each is 
proportional to the degree of excitation E(S), but each substance is 
dissipated at a rate proportional to its own concentration. The differ- 
ence of these two substances, « — 7, acts as a stimulus of intensity 
S = e—j upon any neuron sending out a dendrite from the synapse 
at which the development takes place. 

Now, suppose an outside stimulus acting for a time results in 
the development of <« and 7 at the synapse of the circuit until these 
have the values, say, ¢&) and jo. If ¢) — J» exceeds the threshold of the 
dendrite of the circuit, then re-excitation will be taking place. If the 
outside stimulus is now removed we shall have the equations 


de 
AE (e — 7) 
(1) 
dj 
a —bj, 


governing the ensuing development, where EH (e—j) denotes some 
function of « — 7. If the outside stimulus is not removed, however, we 
shall have to add to the first term on the right of each equation an 
additional positive quantity describing the continued action of this 
outside stimulus. If this added term in each equation is independent 
of « and j, i.e., if the excitations resulting from stimulations of the 
two dendrites combine additively, then we may write, in place of (1), 
the equations 


A[E, + E(e—j)]—ae, 
| (1) 


with EZ, a function only of this outside stimulus intensity. 

We shall consider first the case in which the function E may be 
taken as linear with a sufficient degree of approximation, and second 
we shall consider the more general non-linear case. 
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2. The linear excitation function. We shall assume E, to be con- 
stant. Then E is proportional to e — 7 —h, and if we absorb the con- 
stant of proportionality into A and B, we may write 


Then the equations (1’) have the form 
de , 
(3) 
dj 
with 
a, = A(E, —h) ? a. = B(E,—h), 
6, =A—a, 6.=B-+b, (4) 


Evidently y,, 6. and y2 are positive; a, and a, have the same sign and 
are positive only when E,, and hence the outside stimulus, is suffi- 
ciently large. The sign of £8, depends upon the properties of the neu- 
ron. 


If we set 
ape — 
: Bs Bo 
(5) 
or, employing the values (4), 
A(E, —h) 
af1— 
(6) 


B(E, —h) 
A 
(——>)] 


jo= 


then «, and 7, are the values of e and 7 for which the right members 
of (3) vanish, so that the substitution 


= é&—£0; (7) 


gives 
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dx 
ap — ’ 


(8) 
d 
= you — Bry . 
The characteristic equation of (8) is 


| — Biba = 0. (9) 
V2 | 


We are interested in determining, if possible, values of the parameters 
which yield a stable equilibrium and those which yield periodic solu- 
tions. We shall take up first the case of the periodic solutions. 

A system of linear differential equations possesses a pair of 
periodic solutions provided the characteristic equation has a pair of 
pure imaginary roots. Hence equations (8), and hence (3), have 
periodic solutions provided 


=0, — >O0. (10) 


In terms of the original set of parameters, these conditions are equiva- 
lent to 
A—B=a-b, (11) 
a b 

The first condition thus requires that A shall exceed B, i.e., that 
the rate of production of « shall exceed the rate of production of 7, by 
an amount which is dependent upon the rates of dissipation of « and 
j, aS well as upon the amounts of « and 7 momentarily present. The 
second condition is certainly satisfied if the right member of the in- 
equality is negative. A simple neuron of this type, one not forming 
a circuit, Rashevsky has calied an inhibitory neuron of intermediate 
type, because if it is acted upon by a constant stimulus it develops 
first an excess of e, but later an excess of 7(1, pp. 224, 260-261). 

These conditions are not sufficient, however, for actus] perio- 
dicity. We must have, in addition, the conditions 


&>O0, 


since (€ , jo) is the point about which the fluctuation occurs. If we 
compare (6) with the second condition (11), we obtain the added 
condition 


E,>h, (12) 


| 
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and these conditions, (11) and (12), are necessary and sufficient for 
periodicity. 

This last condition means that the circuit will not exhibit periodic 
fluctuations of excitation if left to itself, but only if a sufficiently large 
constant external stimulus is applied. Such a situation could arise if 
the circuit is connected with another circuit in dynamic equilibrium. 

THEOREM 1. A self-exciting neuron, developing both « and j ac- 
cording to the linear differential equations of the form (3), can exhibit 
periodic fluctuations of excitation only when it is constantly excited 
by a sufficiently great external stimulus, never after it is left to itself. 
If it is stimulated from without by a constant stimulus great enough 
for (12) to be satisfied, then it will exhibit such periodicity provided 
its parameters satisfy the conditions (11), where A and B are para- 
meters governing the rates of development of e and j, respectively, and 
aand b are parameters governing the rates of their dissipation. Con- 
dition (11) concerns only the properties of the neuron; condition 
(12) involves the magnitude of the external stimulus. 


We turn next to the question as to the existence and stability of 
equilibria. There is always one and only one equilibrium point in the 
linear case, viz., the point (e, jo) defined by equations (5) or (6). 
But if this is to be a meaningful equilibrium, both coordinates, «, and 
jo, must be positive. It is evident that both will be positive or else 
neither will be positive. 

Any equilibrium is stable only when the real parts of the roots 
of the characteristic equation are both negative (3). This requires 
first that the two roots shall be either complex, or else real and of like 
sign, and therefore that the second of conditions (11) shall be fulfilled. 
Hence the denominator in (6) must be positive and therefore, if the 
fractions are to be positive, the numerators must be positive and (12) 
must be satisfied. In place of the first of conditions (11), however, we 
have, as the condition that the real parts of the root shall be negative, 


or A—B<a+bdb. (13) 


Hence we conclude: 

THEOREM 2. A self-exciting neuron, developing both e and j ac- 
cording to the linear differential equations of the form (8), can reach 
a state of steady excitation (&, jo) as given by (5) or (6) only when 
it is constantly excited by a sufficiently great external stimulus, never 
when left to itself. If it is stimulated from without by a constant 
stimulus great enough for (12) to be satisfied, then it will possess 
such a state of steady excitation provided its parameters satisfy (13) 
and the second condition (11). In these inequalities, A and B are 


J 
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parameters governing the rates of development of « and j, respectively, 
while a and b govern the rates of their dissipation. As in Theorem 1, 
condition (12) places a lower limit upon the intensity of the outside 
stimulus, while the other two conditions concern only the properties 
of the neuron itself. 


3. The non-linear excitation function. We now consider the case 
in which the excitation function E(e — 7) is non-linear, and we as- 
sume that E is a properly monotonically increasing function of its 
argument « — j, and that it possesses a properly monotonically de- 
creasing derivative. Naturally we still assume that £ is positive only 
when « — j exceeds a positive threshold value h. We consider the de- 
velopment only when the external stimulus is withdrawn, so that 
E, = 0. Hence we return to equations (1). 

Equilibria occur at the solution (¢€, jo) of 


=—- = = - (14) 


For these to be meaningful, each member of (14) must be positive. 
E is positive if ¢,—j, >h. If we set 


AS, 
A B ’ 
(15) 
BS, 
RB 
then 
So = & (16) 
and equations (14) are equivalent to (15) and 
— So 
E(S.) = (17) 
a 


Since the graph of E is increasing and concave downward, it is 
evident that (17) has not more than two real roots. In order that it 


may have two it is necessary that 


A B A aii 
<1, (18) 
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although these conditions are not sufficient.* We assume, therefore, 
that (17) has two distinct real roots, and hence that there are two 
equilibria besides the obvious equilibrium state of inactivity. 

It is convenient to transform equations (1) into a single equation 
of the second order, by means of the substitution 


S=e—j. (19) 


If we differentiate equation (19) twice and equations (1) once we 
obtain, in all, seven equations involving S, «, 7, and their first and 
second derivatives. From these seven equations the six quantities, e 
and j and their derivatives, can be eliminated algebraically, and when 
a+ b there results 


as dS 
la+b— (A—B)E(S)] + as 


— (bA —aB)E(S) =0. (20) 


It is understood that E’(S) signifies the derivative of E with respect 
to S. 


Now let 
and S.=¢>p (21) 
be the two roots of (17) and set 
S=2-+ (22) 
and 
y q. (23) 


The x and y used here are, of course, different from those of section 2. 
We shall show by the first transformation that p is always a point of 
unstable equilibrium. By the second transformation we shall show 
that under suitable circumstances the point q may be a point of either 
stable or else of periodic equilibrium. 

To prove the first assertion, we set 


(bA —aB)E(x-+ p) —abp = (bA —aB)¢4(2) (24) 
Then 
E'(x-+ p) = ¢'(2), (25) 
and 
¢(0) =0. (26) 


* A condition analogous to the second of (18) should have been added to those 
given in 2, p. 274, since failing such a condition only a single, unstable, equilibrium 
would exist. Physically we may expect even the stronger condition E'(«) =—0 
because of the finite refractory period of the neuron (1, Ch. XXII). 
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Since (17) has no root smaller than p and one root greater, it follows 
that at p the derivative of the left member of (17) is greater than 
that of the right member. Hence, if we refer to (25) we see that 


>. (27) 
Therefore if we write 

=¢,24—--- , (28) 
then (27) is equivalent to 

(bA —aB)c,2> ab. (29) 


Equation (20) is equivalent to 
x” + [a+ b— (A—B)¢'(x)] x’ + abs 
— (bA —aB)d(x) =0. (30) 
If we retain only the linear terms in (30) as given by (28), we 
have, approximating 2, the function &, where 
[a+ b— (A—B)c,*]é' + [ab — (bA —aB)c7JE=0. (81) 


The coefficients of and its derivatives in (31) are coefficients of the 
characteristic equation. But the inequality (29) shows that the con- 
stant term in the characteristic equation is negative, and therefore 
the characteristic roots have opposite signs. Hence the equilibrium is 
unstable. 

Turning now to transformation (24), we set 


(bA —aB)E(y + q) —abq = (bA—aB)y(y) . (32) 
Then 
E'(y+q) =y'(y), (33) 
and 
y(0) =0. (34) 
Since q is the largest root of (17), if we write 
wly) = (35) 


then we have 
(bA —aB)d,? < ab. (36) 


In place of (30) we have 
y" + [a+ b— (A—B)y'(y)]y' + aby 
— (bA —aB)y(y) =0, (37) 
and in place of (31), the equation for the approximation y, to y, 
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n” + [a+ b— (A—B)d,7]1 
+ [ab — (bA —aB)d,7]n = 0. (38) 


The constant term of the characteristic equation here is positive, 
so that the characteristic roots are either complex, or else they are 
real and of like sign. For the equilibrium to be stable it is necessary 
and sufficient that we have 


a+b> (39) 


If the characteristic roots are complex, the approach to equilibrium 
(€, jo), given by (15) with q replacing S,, is along a spiral path. 
If the roots are real, the approach is along one or the other of two 
determinate tangents. This may be compared to the two-neuron cir- 
cuit, each neuron developing a single substance, «, where the charac- 
teristic roots are always real and the approach to equilibrium always 
along a determinate tangent (2, p. 281). 

For periodicity to arise, it is necessary, but not sufficient, that the 
inequality sign in (39) be replaced by equality: 


a+b= (A—B)d,’. (40) 


This condition would be sufficient if the curvature of E were so small 
in the neighborhood of S = q that the approximating linear equation 
(38) could be taken as exact. In general, however, even when (40) 
is satisfied so that first-order terms have no damping or dispersing 
effect (leading to the presence of theexponential factor in the solution 
multiplying the trigonometric terms) the terms of higher order may 
have such an effect, and for true periodicity an infinite number of 
conditions must be imposed. These are analogous to (40) but involve 
coefficients of terms of higher order in the expansion (35). However, 
this effect, in general, will be small. 

In summarizing this case, therefore, we may state: 

THEOREM 3. Suppose the excitation function E(S) of the self- 
exciting neuron, which develops e and j according to the differential 
equations (1), is properly monotonically increasing, has a properly 
monotonically decreasing derivative, and is such that equation (17) 
has two distinct real roots. Then equations (17) and (15) define two 
equilibria of the circuit when it is not acted upon by external stimula- 
tion, the equilibrium nearest the origin being always unstable. The 
other equilibrium will be stable, provided the parameters of the neu- 
ron satisfy condition (39). In case the more restrictive condition (40) 
is satisfied, the circuit may be periodic, although terms of higher 
order may exert damping or dispersing effects (which are, in general, 
small). 
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A detailed analysis of the geometric form of the (e,7), path of 
the state of the circuit can be made by the same method as was em- 
ployed by Householder in discussing the two-neuron circuit (2). 

4. A single-substance circuit. In deriving (20) it was necessary 
to assume that a ~ b. If a = b we obtain only a first-order equation 
of the form ; 

= A[E, + E(e)] 
This is precisely the equation of the circuit formed by a neuron devel- 
oping only ¢, and no j7. We consider this type of circuit in this section. 

After making a change in time-units, if necessary, we obtain an 
equation of the same form as that given above, but in which a = 
1 (2, p. 274). By making a change in the unit of the excitation inten- 
sity, also, we may make A = 1. We therefore assume both these 
changes to have been made, and we write 
dt 

We consider only a constant E,. Then equilibrium occurs at « 
= € defined by 


E,+E(e) —e=0. (42) 


We assume again that the function E is properly monotonically in- 
creasing, that its derivative is properly monotonically decreasing, and 
that E(h) = 0. Then there are two possibilities, according to whether 
E,>hor E, < h. 

If E, > h, then (42) has a single root provided only 


<1. (43) 


Otherwise it has none. But if E, < h, then there are two real roots or 
else no real root when (43) is satisfied. 

If E, > h and there is a root of (42), then for ¢ < e the right 
member of (41) is positive, while for e > e, the right member is nega- 
tive. Thus the equilibrium is necessarily stable. However if (48) is 
not satisfied, then the right member of (41) is positive for all values 
of e, and the amount of « increases to infinity so long as E, retains its 
constant value. 

If E, < hand (42) has two distinct roots, say «& = p and & = q 


> p, then 
for h<e<p, e <0; 
for g<e<q, e >; 


for e>q, eé <0. 
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Hence « = q is a point of stable equilibrium, « = p is unstable. Again, 
if (43) were not satisfied, gq would not exist, and if « once exceeded p 
the production of ¢ would continue to infinity. 

An increase in the value of E, results in an increase in the value 
of q and a decrease in or the disappearance of p if p exists initially. If 
the circuit is initially at rest, it can be started only by applying an 
external stimulus sufficient to make E, > h and it must be continued 
until e has time to exceed the value of » corresponding to E, = 0. 

The time required for initiating this activity of the neuron is 
easily obtained from the solution of the differential equations. This 
solution can be written in the form 


de 
(44) 
where «¢, is the value of « at the time f, . 

In partial summary we state the concluding theorem. 

THEOREM 4. Leta circuit be formed by a neuron developing only 
an excitatory substance « according to the differential equation (41). 
Let the excitation function E(e) be properly monotonically increasing 
with a properly monotonically decreasing derivative, and let it be such 
that equation (42) with E, = 0 has two distinct real roots. Then for 
any value of E,, the circuit has always a state of stable excitation. If 
E, >h, then this is the only equilibrium. If E, < h there is also an 
unstable equilibrium, the two equilibria being given by the two roots 
pandq >pof (42). In this case the initially resting circuit can be 
brought to the state of activity by short application of an external 
stimulus sufficient to make E, > h. The time necessary for this appli- 
cation is given by the integral (44), where the lower limit «, is zero 
and the upper limit e is the value of p corresponding to the permanent 
value of Ey. 

5. Some other circuits. Circuits of apparently more complex 
form may, in special cases, be reducible to those previously discussed, 
and we shall conclude by noting a few examples. 

The discussions of the two-neuron circuits which develop only the 
excitatory substance have dealt with the circuits only after the exter- 
nal stimulus has been withdrawn. When the external stimulus is act- 
ing, the equations for such a circuit have the form, analogous to equa- 
tions (4) of (2), 
= E, + 

(45) 


= 
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where the first neuron is stimulated by e, and develops ¢,, and where 
E, and E, are the excitations produced by the external stimulus. If 
E, and E, are constant, then a simple linear substitution 


§&=a—E, (46) 


replaces equations (45) by the equations 


dé, 


(47) 
dé. _ 
where 
H,(é,) = E,(&,+ E,) ’ H,(&2) = E,(& + E;) . (48) 
| | 
So 
S2 S; So 
a. b. 


FIGURE 2 


The functions H have the same monotonicity properties as the func- 
tions E, so that equations (47) are of the same form as the equations 
(4) of (2). The only difference is that the unstable equilibrium may 
no longer exist. The stable equilibrium remains. 

Circuits of this type are represented by Fig. 2b, and, less obvious- 
ly, by Fig. 2a. Here each fiber has a dendrite which reaches forward 
and synapses with a collateral to the axon of the other fiber. 

Such a circuit as this, when it develops both substances, e¢ and j, 


266 
| 


H. D. LANDAHL AND A. S. HOUSEHOLDER 267 


exhibits the same properties as the simple circuit formed by a single 
neuron provided the two fibers are identical, having the same excita- 
tion functions EZ, and the same parameters A, B, a and b, and provid- 
ed, in addition, the external stimuli are always equal. For the two 
neurons will then be interchangeable, «, and «. will remain equal, 
j, and j2 will remain equal, and we may therefore set ¢, = &. = ¢, j1 = 

jo = j, and write equation (1’) as governing the activity of this cir- 
cuit. But if the neurons are not alike, or if the external stimuli are 
not the same, then the values of the («’s) and those of the (j’s) will 
not then be equal, and four equations are needed. 

Even more complicated circuits can be reduced to this case when 
the component neurons are alike and the external stimuli equal. An- 
other possible complication is illustrated in Fig. 2c, where a dendrite 
of each neuron branches and synapses with the axon collaterals of 
both neurons of the circuit. If the neurons are identical and the ex- 
ternal stimuli equal, then the («’s) are equal and the (j’s) are equal 
as before. But since each neuron is stimulated by itself and also by 
the other neuron, the contribution of the circuit to the production of 
e by either neuron is not AE, but 2AE. Hence the equations are 


= AE, + 2AE(e—j) —ae, 

(49) 
2BE (e—j) — bj. 


But these factors 2 do not alter the form of the equations, and if we 
replace, say, 2E by H, equations (49) are obviously of the form (1’). 
In fact, any number of identical neurons could be thus inter-connected 
by means of branching dendrites, and if the dendrites have the same 
number of branches the form of the equations is still the same. 
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The article discusses the general principles involved in factoriz- 
ing correlations between persons, and compares the special tech- 
niques put forward by Burt and Stephenson respectively. The chief 
points of agreement and disagreement are summarized. In statis- 
tical procedure the two writers prove to be largely in accord; but 
they differ over the psychological and the methodological implica- 
tions, the chief source of divergence being the so-called “reciprocity 
principle.” 


INTRODUCTION 


During the past five years the authors of this joint paper have 
worked together in the same laboratory using the device of correlat- 
ing persons, instead of correlating tests, as a basis for factor-analy- 
sis. Starting with virtually the same procedure, devised in the first 
instance for practical problems of educational and vocational guid- 
ance, they have found themselves in broad agreement in regard to 
the main principles and general conclusions. On the other hand, they 
have found themselves diverging more and more in regard to the theo- 
rectical implications of the method. Since in their own more recent 
publications it is naturally these subsequent divergences that have 
formed the centre of discussion, the numerous points of initial agree- 
ment have been overlooked by those who have taken an interest in 
similar problems. 

The issues out of which the differences chiefly arise are seen most 
plainly in Burt’s attempt to review the whole subject (8). His con- 
clusions implied certain tacit assumptions which seemed so untenable 
to Stephenson that, on reading the draft of it in manuscript, he con- 
sidered that less confusion would be caused if its publication were 
postponed until “a joint paper could be drawn up, either that we 
could all agree upon, or else giving our own versions of what we each 
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think we have done.’”’* At the time this proved impracticable. The 
following summary, however, is a considered attempt to produce such 
an agreed statement. Our chief aim will be to formulate, as briefly 
and as concisely: as possible, the main points both of agreement and 
of difference. 

To begin with, we may perhaps indicate what we believe to be 
more or less original in our respective contributions. These happen 
to be the points to which our critics have mainly taken exception. 


INITIAL PROPOSALS 


(A). Burt has made the following suggestions: 

1. In obtaining trait-assessments for a number of persons, to 
begin by grading the various traits for one and the same person (or 
group of persons) instead of by grading the various persons for one 
and the same trait (e.g., in dealing with imagery, to inquire whether 
A’s visual imagery is stronger than his other imagery, instead of in- 
quiring whether A’s visual imagery is stronger than B’s), and to use 
these assessments for determining the mental “profiles” or “patterns” 
characteristic of different individuals, types, sexes, social classes, and 
the like. Such a method of grading has proved to be of special value 
(a) in practical work (educational, vocational, or clinical — cf. 2, 3, 
4), particularly where the assessments have to be made, not by the 
same observer, but by different observers whose standards cannot 
well be equated; (b) in theoretical work (5, 9), particularly where it 
is the relative strength of all the traits rather than the absolute 
strength of each trait that provides the requisite description of the 
individual, type, or group. 

2. On the basis of such gradings, to calculate covariances (or 
correlations) for persons as well as, or instead of, for traits. 

3. To apply factor analysis to tables of covariances between 
persons as well as to tables of covariances between traits (8 and 
refs.). 

4. More generally, to distinguish between row-factors, obtained 
by postmultiplying the measurement-matrix by its transpose and fac- 
torizing the product, and column-factors, obtained by premultiply- 
ing the measurement matrix by its transpose (e.g., by correlating 
rows and correlating columns, respectively). When the initial ma- 
trix is arranged according to the usual convention, the row-factors 
are factors for tests or traits, and the column-factors are factors for 


* Eventually it was agreed that this particular paper, dealing with the al- 
leged reciprocity between correlating persons and traits, should be published; 
and Stephenson’s reply was added in a postscript to his article in this journal 
(7). 
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persons; but such designations are not universal, since in certain cases 
tests may become personified and appear as examiners, and other 
persons may be used as tests to test them (e.g., when the “reliabil- 
ity” of different interviewers is tested with a given batch of candi- 
dates) ; in other cases the columns may represent gradings obtained 
for one and the same person only, or for combined groups of persons. 

5. To draw a sharp distinction in the results of factor-analysis 
(whether reached by correlating columns or rows, persons or traits) 
between (i) the first or general factor, which in nearly every form 
of factor-analysis (a) has positive saturations or loadings only, (b) 
emerges first (but may not be found at all), and (c) accounts for 
more of the total variance than any other; and (ii) subsequent or 
secondary factors, which, according to the method of analysis, will 
be either (a) “bipolar factors,” i.e., factors having negative as well 
as positive saturations or loadings (as in multiple general factor 
analysis before rotation), or (b) “group factors,” i.e., factors hav- 
ing mainly positive or (approximately) zero factor loadings (as in 
group-factor and bi-factor analysis, and in multiple factor analysis 
after rotation). 

6. Adopting these distinctions, Burt claims to have shown (a) 
that the general factor for persons cannot be obtained by factorizing 
correlations between traits; (b) that the general factor for traits 
cannot be obtained by factorizing correlations between persons; (c) 
that with one and the same matrix, suitably standardized and suit- 
ably analyzed, the secondary bipolar factors are approximately the 
same whether obtained by correlating persons or correlating traits— 
exactly the same under certain definable conditions; (d) that second- 
ary bipolar factors express differences between positive group fac- 
tors (9, 10). 

7. To use the first or “general” factor for persons (which can- 
not be obtained by correlating traits) as a mode of measuring char- 
acteristics (e.g., correspondence with an ideal standard) which can- 
not be measured by other means. The weighted average of trait- 
measurements then provides the ideal standard — a set of “true” 
values by which the reliability of examiners or the preferences of 
judges may be assessed (cf. 11 and refs.). Where, however, the dif- 
ferences in weights are non-significant, Burt would prefer to use the 
method of analyzing variance, developed by Fisher.* 


*In general the two methods lead to similar results, except in regard to tests 
of significance with small samples or non-linear correlations. When (as is com- 
monly the practice in psychology) the trait-measurements are reduced to stand- 
ard measure, the relations between correlational analysis and analysis of vari- 
ance may at their simplest be summed up as follows: (i) the average inter- 
correlation is identical with the intra-class correlation; (ii) the average satura- 
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8. To use the secondary “bipolar” or “group” factors obtained 
by correlating persons (a) as a means of checking the alternative de- 
termination of types obtained by correlating traits, (b) as a method 
of determining types where the correlation of traits is impossible or 
unreliable (e.g., in the determination of imagery types based on selt- 
assessments) (cf. 12 and refs.). 

9. To regard the saturation coefficient for a person as itself a 
correlation with the type standard, and to use its inverse hyperbolic 
tangent (obtained by Fisher’s z-transformation) to measure that per- 
son’s approximation to the type. (The advantage of the transforma- 
tion is that a normal distribution of the type-measurements is no 
longer precluded). 

10. To base reliability coefficients for the same person on a num- 
ber of trials instead of two, and by that means to study intra-person- 
al variability and the “cyclic” or “spiral” character of such variations. 

11. To use the relations of equivalence, obtaining between per- 
son-factors and trait-factors under certain restricted conditions, as 
a ground for selecting one particular system of factors from a large 
or infinite number of such systems that might otherwise be avail- 
able (8). 

For brevity, the adoption of these devices — particularly those 
described in paragraphs (1) to (9) — has occasionally been termed 
“P-technique” to distinguish it from the commoner methods of fac- 
tor-analysis that start by correlating traits (‘‘T-technique’”). 


(B). Stephenson has made the following original suggestions: 

1. To use two-factor theorems, instead of multiple factor theo- 
rems, for analyzing correlations between persons (6). 

2. To distinguish between fractional and non-fractional factors, 
substituting the conception of fractional factors for that of multiple 
factors (13). Non-fractional factors refer to the total communality 
of a variable, and represent abilities and types; fractional factors 
seek to analyze them. 

3. On this basis to develop correlating persons into a new tech- 
nique (“Q-technique”’) leading to factors wholly independent of those 
that are obtained by correlating traits or tests. To this end he pro- 
poses to retain the fundamental distinction between populations and 


tion coefficient is approximately equal to the correlation-ratio, calculated as the 
ratio of “variance between means” to “total variance”; (iii) in factor-analysis 
the basis of comparison is the mean of the squares of the saturation coefficients; 
in the analysis of variance it is virtually the square of the mean of the satura- 
tion coefficients (exactly so, with the summation method and perfect reliability; 
approximately so, with other methods or assumptions). 
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variables, and between the kinds of objectives that can be served by 
persons (a) as variables and (b) as items of a statistical population 
respectively (7). 

4. To formulate a set of inverted theorems for such a tech- 
nique* (6). 

5. To introduce for these purposes new concepts such as “psy- 
chological significance” (distinct from “statistical significance’), 
“dominance,” “fractional and non-fractional factors,” ete. 

6. To apply such methods and conceptions systematically (a) 
to the measurement of abilities by reference to a single standard per- 
son; (b) to evaluations obtained from a single person only; (c) to 
experimental work in the laboratory,; as distinct from educational 
or vocational work in the field, and so to solve problems of general as 
distinct from individual psychology (13, 14). 

7. Toconstruct on these foundations a new branch of typology, 
which may be termed “personalistics.” For this purpose Stephenson 
proposes to distinguish between different uses of the word “type”: 
(a) In one usage, types are merely arbitrary sections of a general 
tendency (e.g., introvert-extravert as extremes of a uni-dimensional 
scale of I - E); (b) in another, types are patterns of such tenden- 
cies (e.g., the common-sense type of person has more w than g): 
both of these pertain to types as “classes of beings having a common 
characteristic.” (c) In the third usage, types are lived, specimen- 
entities — characteristic specimens of a special class of beings. Ste- 
phenson gives an explicit definition of this third usage, namely, any 
persons who satisfy the conditions for a non-fractional factor in Q- 
technique are by definition persons of a type (14). 

8. Throughout, to distinguish rigidly between the methodologi- 
cal bases of r- and Q- techniques. Laws in 7-technique derive from 
large numbers of cases (persons) and from individual differences; 
laws in Q-technique can derive from work on one person only, and are 
independent of individual differences. 


POINTS OF AGREEMENT 
After a good deal of mutual discussion we have eventually 
reached the following points of agreement. We both agree: 


* Beebe-Center and Burt had deduced and used similar formulae for this 
purpose, but these were merely the old formulae for correlating traits applied 
to new problems, — Spearman formulae in the case of Beebe-Center, summation 
formulae in the case of Burt. As noted below, Burt regarded such formulae as 
capable of yielding first approximations only to the results obtained by a more 
nn a aga factor analysis based on covariances and the principle of least 
squares 

+ Burt and Binns had applied the method to problems of touch discrimina- 
tion, but only incidentally, not as a systematic branch of laboratory technique (3). 
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1. That correlating persons is a valid general statistical tech- 
nique. In this we both differ from those who would use only product- 
moments or specially devised techniques (e.g., Rhodes) or who con- 
sider the whole procedure invalidated by incomparability of units 
(except in special and restricted cases, e.g., Thomson). 

2. That for one and the same matrix the general factor for per- 
sons cannot be obtained by correlating traits, and similarly that the 
general factor for traits cannot be obtained by correlating persons. 

3. That, in analyzing correlations between persons, instead of 
finding a single general factor sufficient to explain all the correlations 
(as Beebe-Center has apparently maintained), secondary or “group” 
factors are almost invariably discoverable. Thus, in analyzing correla- 
tions between persons as in analyzing correlations between traits, we 
both now agree that it may be necessary to envisage factors of four 
types: (i) general factors, (ii) group factors, (iii) specific factors, 
(iv) error factors (5, p. 259). 

4. That, in analyzing correlations between persons, besides gen- 
eral factor techniques, what Burt has called the group-factor tech- 
nique and Holzinger the bi-factor technique is therefore permissible.* 

5. That the saturation coefficients obtained by correlating per- 
sons may be used to measure the person’s approximation to a type— 
the type being regarded as a single ideal (or idealized) person. 

6. That in all research involving factor-analysis the choice of 
an appropriate experimental design is quite as important as the choice 
of an appropriate method of mathematical analysis, and that the fac- 
tors discovered must depend quite as much upon the former as upon 
the latter. 


POINTS OF DIFFERENCE 

On the following points we find ourselves in sharp disagreement. 
We therefore put them forward as indicating a program for future 
discussion and research. 

1. We differ, first of all, over the importance of the differences. 
Stephenson regards them as wide and fundamental; Burt thinks them 
small and subordinate, and capable of ultimate reconciliation. 

2. The general character of the differences may be summed up 
by saying that Stephenson insists on a sharp opposition between r- 
technique and Q-technique, whereas Burt would regard them as in- 
volving much the same aims, methods, and theorems, and, in principle, 
though not always in fact, as merely alternative ways of analyzing 
any rectangular table of figures. 


* Stephenson would point out that Spearman workers have long made analy- 
ses substantially similar to that recently described by Holzinger as bi-factor. 


4. 
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3. The differences seem mainly to spring from a fundamental 
difference of aim: this appears to have been overlooked by our earlier 
critics who noted only the superficial similarity in procedure. In sub- 
stituting correlations between persons for correlations between traits, 
Burt remained preoccupied with much the same problems as before, 
and consequently (quite wrongly in Stephenson’s opinion) treated the 
adoption of the alternative approach mainly as a matter of practical 
necessity or convenience. Stephenson’s Q-technique is concerned with 
an entirely new set of problems, and is therefore put forward with 
objectives wholly different from those envisaged by r- or P-technique. 

4. We differ over our general methodological approach. For 
elementary statistical reasons, Stephenson draws a sharp distinction 
between statistical variables and statistical populations, which Burt 
appears to confuse: r-technique applies to a population of persons 
with tests or traits, etc., as variables; Q-technique to a population of 
tests or traits, etc., with persons as variables. By retaining this dis- 
tinction Stephenson seeks to remind the factorist that it is his duty, 
as a psychologist, to propose psychological hypotheses, and to “con- 
trol” these by appropriately conceived variables. He therefore refuses 
to reduce everything to statistical populations both ways in any one 
matrix. 

Burt does not deny the distinction between variables and popula- 
tions (or rather constants descriptive of total populations) ; but he 
regards it is an ad hoc rather than as an absolute distinction: he sees 
no a priori reason why the constants in one mathematical argument 
should not become the variables in another. For him the term vari- 
able simply denotes any one element in a class or “population.” In 
factor analysis we are concerned with three “populations” or classes, 
namely, “persons,” “traits,” and “factors”; and, in representing their 
relations geometrically, we may, according to convenience, take any 
one of the three as the initial axes of reference, thus obtaining “P- 
axes,” “T-axes,” or “F-axes.” Since, as a rule, the factors are by def- 
inition independent, Burt regards it as more convenient (but not es- 
sential) to treat the rectangular F-axes as fixed (though initially un- 
known), and, according to the nature of the problem or the data, to 
represent either traits or persons by vectors in the space defined by 
these factors. 

5. Stephenson holds that his own conception of Q-technique in- 
volves a complete break with the concepts of r-technique, i.e., the en- 
deavour to describe psychological phenomena in terms of mental 
“abilities” and mental “tendencies” as these are understood in r-tech- 
nique. In his view Burt’s statements about the limitations of corre- 
lating persons show that he still retains the standpoint of earlier psy- 
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chometry, i.e., he thinks exclusively in terms of the “abilities” or “ten- 
dencies” of the old 7-technique. His work on P-technique, therefore, 
appeared to Stephenson to have entirely missed the essential possibil- 
ities implicit in the idea of correlating persons. Consequently, though 
suggested in the first instance by issues raised at the International 
Conference on the analysis of examination marks and at a laboratory 
discussion on the applicability of Burt’s technique to such problems 
(June, 1935), Stephenson’s developments of the method must be re- 
garded as the independent outcome of special problems of his own. 
He maintains that, though person-factors had previously been used 
for problems conceived on r-technique lines, the wider implications 
of such factors had been, and still are, almost entirely ignored, except 
in work based explicitly on his own conception of Q-technique. As a 
consequence, all earlier investigations, based on correlating persons 
or extracting person-factors, appear to him to come far short of a 
general psychometric technique avowedly defined as such, for the 
plain reason that it has refused to break (or has never thought of 
breaking) with conceptions that are part and parcel of r-technique. 

6. We thus differ (at least in Stephenson’s view) in regard to 
the traits, tests, or type of marking with which the investigations 
should start. According to Stephenson, in Q-technique they must be 
different from those used in r-technique. The view put forward by 
Burt and accepted by Vernon, namely, that the self-same traits that 
are used as variables in correlating traits can change in the twinkling 
of an eye into chameleon-like items of a statistical population when 
correlating persons, appears to Stephenson a gratuitous assumption. 
Q-technique thus leads to entirely new principles of test construction. 

7. We apparently differ in regard to the units to be used in 
correlating persons. In measuring the traits Burt would prefer (a) 
to work with absolute units where possible, making no assumption as 
to the distribution of persons or traits; failing absolute units, (b) to 
start with measurements first standardized for traits or tests; (c) 
where both are impossible or inappropriate, to select the traits or 
test items so that they conform as closely as possible to a specified 
distribution, e.g., rectilinear or normal. In Stephenson’s view this 
limits the units to those that pertain to, and are defined in terms of, 
individual differences. In his Q-technique individual differences are 
wholly irrelevant; a new unit of quantification has therefore to be 
forged which may be termed “significance.” The preliminary stand- 
ardization for traits relative to a population of persons, he believes, 
can lead only to trivial results; and in Q-technique he would begin by 
standardizing measurements for persons on the assumption of a nor- 
mal distribution: i.e., for Q-technique the units must be so chosen 
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that their mean must be zero and the standard deviation unity for the 
population of traits within the person. 

8. Where absolute units are available, Burt proposes to base the 
analysis on product-moments (or on averaged product-moments, i.e., 
variances and covariances) rather than on correlations. He admits 
that for many psychological measurements this may at present be im- 
practicable; and he agrees that correlation is the better method where 
the essential point is to measure the resemblances between persons 
rather than to analyze variance. Stephenson, on the other hand, con- 
siders that correlation should always be used, since (a) covariance 
foregoes the all-important distinction between variables and statisti- 
cal population and (b) variances in psychology can be made arbi- 
trarily just what we please. 

9. For the same reason Stephenson holds that the “analysis of 
variance” (as developed by Fisher and Snedecor) is inapplicable to 
psychological measurements. Burt, on the other hand, regards the 
applications here proposed [page 271 above, (A), para. 7] as a natur- 
al extension of the early procedure adopted by Cattell (1) and his fol- 
lowers; and he considers that by using the analysis of variance in- 
stead of factor-analysis more accurate tests of significance are made 
available, particularly when the samples are small, the correlations 
non-linear, and only one or two “factors” are at issue. 

10. We differ fundamentally over the type of factor-analysis to 
be employed. Burt’s original memorandum on the subject assumed 
that a multiple-factor technique alone was appropriate. Stephenson, 
on the other hand, in his very first contribution on correlating per- 
sons, put forward a two-factor technique instead, and has worked out 
afresh a set of theorems to this end: he maintains that, in correlating 
persons, the two-factor technique is not only sufficient, but closer to 
psychological needs. Burt has occasionally adopted this simpler tech- 
nique in early work where the correlations between persons appeared 
to present a single-factor hierarchy (e.g., 2, p. 136), but considers it 
invalid as a general rule, since with low probable errors more than 
one common factor is nearly always demonstrably required. 

11. We differ somewhat in the considerations which make us pre- 
fer in certain cases to use the bi-factor, group-factor, or submatrix 
technique (Burt’s method a, 5, p. 306). Burt believes that it is appro- 
priate only where the variables form discontinuous groups (10). Ste- 
phenson, on the other hand, would here distinguish between non-frac- 
tional and fractional factors (which will usually be oblique). To dis- 
cover types, he merely reorganizes a table of correlations into clus- 
ters (method a) each of which is sufficient and necessary to define a 


type (13). 
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12. Our chief differences, however, turn on the relations obtain- 
ing between test-factors and person-factors. Burt believes that, 
where conditions are comparable, the factors obtained by 7-technique 
and Q-technique are virtually the same, except, of course, the first or 
general factors. In the simplest case (covariating rows or columns of 
the same doubly-centered matrix) he maintains that the factor-load- 
ings for persons are mathematically identical with the factor-meas- 
urements for traits, and the factor-measurements for persons with 
the factor-loadings for traits; under less rigid conditions he regards 
them as differing only in respect of minor inequalities in weighting 
(8). Stephenson holds that Burt’s algebraic proof is vitiated by un- 
tenable assumptions, and, further, that, even if it were true, the de- 
ductions drawn would render the whole procedure useless, since it 
would follow that it was waste of time to correlate persons as well. 

13. As a consequence of this so-called “reciprocity principle” 
(15), Burt holds that, in theory, secondary or group-factors obtained 
in correlating persons should be of the same nature and number as in 
correlating traits: in practice, since factors in the first instance mere- 
ly define principles of classification, not concrete mental entities, we 
are free to take as few or as many of this number as we please. Ste- 
phenson regards the factors obtained in r-technique as defining the 
fundamental abilities or tendencies of men: consequently, in correlat- 
ing tests the factors will be narrow and rare. On the other hand, in 
Q-technique we seek to map out the field into groups of persons who 
resemble one another with respect to whole aspects of their personal- 
ity: consequently, in correlating persons the factors will be numer- 
ous and broad. After all, in any table of measurements the number 
of tests is comparatively small, while the number of persons may run 
into hundreds or thousands. To this Burt would reply that, although 
the order of the correlation-matrix for persons may as a result be 
much greater than the order of the correlation-matrix for tests, the 
maximum rank of either matrix is limited by the smaller number of 
the two. Hence the maximum number of independent factors (and, 
he would maintain, the actual number of significant factors) must re- 
main the same in either case. Stephenson holds that the number of 
factors is a question of fact, not of mathematical parsimony or prac- 
tical convenience. 

14. Stephenson would add that Burt’s arguments confuse a re- 
ciprocity that may conceivably apply to one and the same set of data 
with Stephenson’s postulation of two distinct sets of data that are 
quite unrelated and serve different psychological objectives: 1r-tech- 
nique pertains to individual differences, Q-technique pertains to gen- 
eral psychology, and is in no way concerned with individual differ- 
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ences (cf. 7 and 13, p. 9). Burt replies that on Stephenson’s own 
showing each of these two sets can still be analyzed both ways,* and, 
in point of fact, all Stephenson’s own analyses by Q-technique seem 
so far to have led to almost exactly the same factors as had previous- 
ly been obtained by r-technique (linguistic and non-linguistic types, 
extraverted and introverted types, objective and subjective types, 
etc.). 

15. We differ over the distribution of trait-measurements for 
one and the same person. Stephenson postulates a distribution fitting 
the normal curve of error (13, pp. 191, 193). Burt would agree that 
traits and test-material can be selected so as to fit, more or less rea- 
sonably, a normal distribution ; but, as with the distribution of popu- 
lations of persons for a single trait, so with the distribution of “popu- 
lations” of traits for single persons, he considers that the “postulate” 
of normality should be accepted only after it has been empirically con- 
firmed. Actually the lists of traits commonly employed (e.g., 13, p. 
20) appear to him to be not only non-normal, but discontinuous and 
multi-modal. 

16. We therefore differ widely over the occasions on which we 
believe correlating persons to be legitimate. Burt holds that, if the 
traits are normally distributed, correlation may be used; alternative- 
ly, if the units of measurement are objective, covariance may be used. 
But, since neither condition is easily obtainable, he would prefer, 
where possible, to base the analysis of types on correlations between 
traits rather than on correlations between persons. Since Q-technique 
aims at entirely different types, Stephenson cannot accept this sug- 
gestion. 

17. We differ over the distribution of the types measured by the 
saturation coefficient for persons. Here Burt believes that the distri- 
bution of factor saturations, obtained by correlating persons and 
transformed to z-values, is approximately (though not exactly) nor- 
mal. Stephenson believes that the results indicate separate and dis- 
continuous types — as distinct as primroses from buttercups — so 
that certain persons may have zero saturations for certain type-fac- 
tors. 

18. We differ over the meaning to be attached to the factor- 
measurements for the type. We agree that the saturation coefficient 
for a person expresses the extent to which he fits this pattern. But 
Burt regards the type described by this pattern as an extreme and 
hypothetical ideal, towards which actual persons more or less approxi- 


*“Nor is any analysis of data complete until it is examined both ways, by 
r- and Q-techniques” (6, p. 360). 
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mate, but with which they never perfectly correlate. Stephenson re- 
gards such a pattern as a caricature. For him the ideal representa- 
tive of the type is a specimen personality, i.e., the modal person, not 
the extreme of a normal (or nearly normal) distribution (13, p. 198). 

19. These further divergences are evidently related to the partic- 
ular interpretations we attach to the notion of psychological “types.” 
For Burt a type is merely a pattern of weighted assessments, into 
which different individuals fit more or less closely. As such it is ex- 
pressible in terms of a single factor. All types are therefore related 
to corresponding tendencies, which may be highly complex; but such 
tendencies are descriptive only. It follows that we can distinguish, if 
we wish, as many types (or pairs of types) as there are factors dis- 
coverable in correlating traits. To this Stephenson can agree only 
with important reservations. For him, a psychological type [as de- 
fined in paragraph 7 (c) above] has no mathematical relation to the 
tendencies or abilities which are the concern of r-technique alone. 
Types which are the concern of Q-technique are related to such un- 
derlying tendencies in a statistical fashion only: between the under- 
lying tendencies and the lived personality the relation is so involved 
and highly ramified that statistical representation alone does justice 
to the facts. Believing, as he does, that only about five common fac- 
tors can be discovered when correlating traits, he cannot admit that 
the number of types is limited by the number of factors discoverable 
by r-technique. On the other hand, when correlating persons, just as 
he believes in an innumerable number of factors, so he believes in an 
innumerable number of types: types, in fact, as he defines them, can 
be as numerous as the genera and species of animals in the present 
epoch of fauna. 

20. Thus, although expressed in statistical terms and producing 
minor differences in statistical procedure, the fundamental differences 
between us are not so much statistical as methodological. In Q-tech- 
nique there is to Stephenson’s mind nothing statistically novel, and 
hence statistical criticisms of this formula or that are of little mo- 
ment. In Burt’s work on person factors there is, he admits, statis- 
tical novelty, but he regards it as psychologically unimportant. On 
the other hand, Stephenson believes that on the psychological side 
Q-technique opens an entirely new field, namely, typology as he has 
defined it above: and it is this, not the mere mathematical procedure, 
that he has throughout desired to stress. 
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STUDIES IN MATHEMATICAL THEORY OF 
HUMAN RELATIONS. II 


N. RASHEVSKY 
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In continuation of previous studies, relations are studied be- 
tween two classes of population, of which one is characterized by a 
much greater ability to organize and supervise the productive activ- 
ities of the other. Under some special and rather simple assump- 
tions, it is shown that this kind of interreaction results first in an 
increase of the ratio of the cumulative results of productive activ- 
ities for the two classes in favor of the first. With time, however, 
this ratio reaches a maximum and declines. An expression for the 
“life span” of such organized classes is obtained. 

In the second section, a study is made of possible non-uniform- 
ities of spatial distribution of the population. Some possible appli- 
cations are made to the theory of the variation of the ratio of urban 
to rural population, showing how that ratio may increase with in- 
creasing total population. 


I 


In a previous paper (1) we discussed some possible interrela- 
tions between social classes based on the different amounts of initia- 
tive and imitativeness. In the present paper we shall discuss an inter- 
reaction of classes based on a different characteristic, namely, the 
difference in different people of the ability to perform different kinds 
of work and to produce different goods. In a sense, such discussion 
should lead us into the domain of mathematical economics, and in this 
field a very considerable progress already has been made by well- 
known works (2) of mathematical economists. However, we shall 
approach these problems from a somewhat different point of view, 
which will bring out particularly the social aspect of the interrelation 
of economic classes. In order to do so, we shall start again with high- 
ly oversimplified hypothetical cases. In view of the fact that much 
more complex and realistic cases have already been treated by the 
above-mentioned scientists, it may seem rather strange to start again 
with oversimplified cases. However, as we shall see, the problem 
which interests us here is different from the usual type of problems 
treated, offering its own complications; this consideration justifies 
the apparent set-back. Our failure to make use of those well-known 
results is due not to ignorance or neglect, but to the circumstance that 
the particular problems discussed here are not yet ripe for the use 
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of those more advanced results. This will become apparent to the 
reader as he peruses section I. (Cf. also the concluding paragraph 
of section I.) 

Let us again consider for simplicity a society composed of two 
types of individuals J and J] , their corresponding numbers being N, 
and N.. Let these two types of individuals form two corresponding 
social classes, class J, and class JJ. Let us consider the following 
purely hypothetical case. 

Each individual of type J produces per unit time an amount p, of 
goods necessary for the maintenance of his life, and he consumes per 
unit of time an amount c, of these goods. Similarly every individual 
of type JJ produces and consumes correspondingly amounts p. and 
Co. at 


and (1) 


Then, in the absence of any exchange of external supply, the individ- 
uals of type J are capable of indefinite existence, those of type II are 
not. 

Now let the individuals of type J be capable of organizing their 
work, as well as the work of individuals of type JJ, in such a way 
that the production of the necessary goods is largely increased. In 
other words, let the production of the goods require a certain amount 
of organization skill, which is possessed by individuals of type I, but 
not by those of type IJ , so that while for themselves the individuals 
of type IJ cannot produce enough goods, they can do so under the di- 
rection of the individuals of type /. 

While this is a rather abstract situation, yet it reminds one of 
some real situations. Thus a large number of workers, who produce 
complicated machinery in a factory as a result of work directed by 
a group of engineers and executives, would not be able to produce the 
same results, when left to themselves. 

Under the conditions assumed above, the following relation will 
naturally establish itself between the individuals of type J and those 
of type JJ. The latter will agree to work under the direction of the 
former, so as to increase their output in goods, provided they do re- 
ceive a part of the goods thus produced for their consumption. On 
the other hand, the individuals of type J , unless they act purely altru- 
istically, a supposition too unlikely to be considered even theoretically, 
will agree to direct the work of the type IJ individuals only ft they 
receive also part of the goods thus produced. Each individual of type 
IT will give an amount w of his labor per unit time. The amount w 
may be conveniently measured in working hours per day. In compen- 
sation for that amount of work, he will get an amount éw of goods. 
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6 may be considered as the price of his labor, paid not in money, but 
in goods directly. 

If, for simplicity again, we consider the case in which all Nz in- 
dividuals of type JJ are giving their work, the total amount P of goods 
produced under the direction of individuals of type I will be propor- 
tional to N.w , thus 


P AN.w (2) 


The coefficient of proportionality A will itself depend on both N, and 
N, in general, so that 


A=A\(N,,N.2). (3) 


The nature of this functional dependence will itself be determined by 
the character of the labor involved, method of production, ete. In the 
simplest case, however, it can be seen that A(N,, N.) should, at least 
approximately, be a function of N,/N., so that 


(4) 


The reason for this assertion will be seen, if we remember that, other 
conditions being kept constant, increasing both N, and N. m times 
should approximately increase P also m times. Hence such an increase 
must leave A invariant. 

Some other properties of the function f in (4) may be found 
from general considerations. Since in the absence of individuals of 
type I no goods are produced under their direction, we must have 


f(0) =0. (5) 


Keeping N., the number of workers available, constant but increas- 
ing the ratio 
N, 

by increasing N, , will, for small values of N, , increase P. But, when 
for a constant N. , N, exceeds a certain amount, any further increase 
of N, will not result in any increase of P. A certain number of work- 
ers may require a definite number of supervisors, but an excessive 
number of supervisors does not increase the output. Hence f(7) must 
either have a maximum for a certain value of 7, or it must increase 
with 7 only up to a certain value of 7, and for any further increase 
of the latter remain practically constant. 

In order to obtain closed expressions, we may consider as a very 
rough approximation 


(6) 
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f(y) =f.(1—e) . (7) 
This of course does not preclude consideration of other forms. We 
may take also 


f(y) — (8) 


and consider it only in the range 0 < » < a,/a,., for which the ex- 
pression (8) is positive. It has a maximum for » = }(4@,/d.), the 
maximum value being = 4(a,?/a.). 

Introducing (6) into (4) and then the latter into (2), we have 


P=f(y)Now. (9) 


Of the total amount P of goods produced, an amount 6N.w is received 
by the N, individuals of the type II, the remaining amount being re- 
tained by those of type J. Hence the net rate of increase of the total 
amount W, of goods, produced by individuals cf type J themselves 
and by those of type J/ for the class /, is: 


= +f Nato — ON 20 (10) 
and, similarly for the class IJ , 
= + . (11) 


Even though, according to equ. (1), «. < 0, dW./dt may be positive, 
since 6N.w > 0. It must be remarked however, that (1) is not neces- 
sary at all in order to have (10) and (11). Quite generally, we ob- 
tain the same type of interreaction of the two classes even when both 
€, and «, are positive. This type of cooperative interaction may in- 
crease the rates dW,/dt and dW./dt from their values N,e, and Ne, 
which they would have in the absence of such a cooperation. This pre- 
supposes such a choice of constants that 


f(m) > 6. (12) 


Inequality (12) fixes the highest “price” that the first class will be 
willing to pay for labor. The actual @ is determined by the demand of 
the individuals of type IJ for the goods produced. Just as in the stand- 
ard treatments of mathematical economics, the larger @ per unit of 
w , the more of w an individual is willing to give. Hence 


w=—u(6), (13) 


u being an increasing function of 6. For simplicity let us put 


+ 
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W = Wo (14) 
w, and b being constants. Expression (14) is suggested by the use of 
simple linear approximation for demand curves and by remembering 


that may be considered as the “price” of the goods in terms of labor. 


For 6 < b/w,, individuals of type IJ will not agree to work at all. For 
@ = oo they cannot work more than the physically possible amount wp . 
Introducing (14) into (10) gives: 

dW b 

+ (n) — 8] (15) 
dW,/dt is now a function of 6, and if the individuals of type J wish 
to make dW,/dt as large as possible, they will agree on a 6 which 
makes the right-hand side of (15) a maximum. This happens for 


_ 
a= || (16) 


and, for the maximum value of the right-hand side of (15), the latter 
becomes 


dw, 


Nits + Nalwof — 2v (y)] (17) 
Introducing (16) into (14), and then both into (11), we find: 
dW, 
ar = Noles — b + (18) 
Equ. (17) may be written thus: 
dw, 
= N.[en + Wof (n) + b— 2V (19) 


In order for any such interreaction as discussed here to be possible at 
all, both classes must gain something. This imposes certain inequal- 
ities on the constants, namely, 


en+ Wof (n) + b —2\V/bwo) >0 (20) 
and 
&—b-+ vbwF() > 0. (21) 


If (21) is to hold, and if «. < 0, then a necessary condition is , - 
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Vbwof (ny) —b>0. 
Because of (16), and since necessarily 6 > 0, this requires: 
b < Wof (22) 


But (22) is not sufficient in this case. In the interval 0 < b < wof(y), 
the expression \/ bw,f (yn) — b is positive and has a maximum for 


Wof (y) (23) 
4 
the maximum value being equal to 
Wof 
(24) 


Hence, for «. < 0, the inequality (21) requires not only inequality 
(22) but also 


> (25) 
Requirement (20) may be replaced by a more rigid one, namely 
Wof (yn) + b—2Vbw f(y) >0. (26) 


Only in this case is dW,/dt larger with the cooperative arrangement 
than without it. But for values of b in the neighborhood of b,, the 
expression b — 2\/bw,f (7) is negative. However, for b = b,, this ex- 
pression equals — # w,f(y). Hence for b = b,, the left-hand side of 
(26) equals 4w.f(n), and (26) is therefore satisfied. 

These considerations show that b would have to be in the neigh- 
borhood of b,, in order that there would be a gain from cooperation to 
individuals of class IJ for e«. < 0. A value of b too large or too small 
results in a loss to the individuals of type IJ. If, however, «. > 0, 
then all that is necessary is that b should be sufficiently small. In this 
case too large a “price” asked for labor will result in the impossibility 
of reaching any agreement or cooperation. 

If e. < 0 and (25) holds, then with b ~ b,,, dW,/dt > dW./dt. 
Equations (19) and (18) integrated give: 


W, = N.[en + wof (n) + — 2Vbwof I]t + Wo, (27) 
W,= —b + Vbwof t + We, (28) 


where W,, and W.. are initial values. If we start with equal amounts 
of W per capita, so that 
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Wau _N; 
or 
Wo nWoe (30) 


then for t = 0, W,/W. = 7, while for sufficiently large ¢ this ratio 
tends to 
W, + Wof + b— bwof (7) 


>1>7. (31) 
W, V bwof (m) : 


Thus there will be a gradual increase of the ratio of accumulated 
goods for the two classes. 

However, things will happen in this manner only when N, and 
N, remain constant. The situation will be different if we consider a 
sufficiently long period of time, during which not only the size, but 
also the composition of the classes changes. Originally we have N, 
individuals of type J forming class J , and N. > N, individuals of type 
IT forming class IJ. Let us consider again, as in the previous paper 
(1) that the progeny of parents of a given class continue to belong 
nominally to the same class, although they may be of a different type. 
Denoting by n,’ the number of individuals of type J in class J, by 7,” 
the number of individuals of type IJ in class J, by n,’ the number of 
individuals of type J in class II , and by n.” the number of individuals 
of type IJ in class IJ, let us put: 


M+ Mm" =% 
(32) 
Ny’ + nN,” = 
Denoting by # the rate of increase of the population, assumed the same 
for all types, and by a the fraction of individuals of type J born, we 
have (1): 


dn, _ dn, _ 
pr, ; ; (33) 
Ny = My, CBE Me = Nop CBF (34) 
and 
(1 — a) Bn, ; 
(35) 
dn,’ _ 
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Remembering that for = 0, 2,’ = 1,7," = = , 
we find from (35) and (34) 
nN,’ = amo, —1) +%, = (1—a)M,(e8! —1) ; 
(36) 
= My” = (1 —a) 4 ang. 


For simplicity we neglect here the effect of ‘“‘vertical mobility” from 
class to class (1, 4). 

According to the picture made here, these variations of the dif- 
ferent 1; result in a variation of 7 , for 7 is nothing else but 7,'/2, . 
The quantities «, and «, are also affected by these changes, for when 
the whole class J of n, = n,' + n,” individuals is not composed of indi- 
viduals of type J, but of individuals of both types, we should substitute 
for ¢, and «, average values, such as 


Putting 
n , 
(38) 


introducing (36) into (37) and (38), and then introducing the result 
into (19) and (18), we obtain differential equations for W, and W,, 
which, however, are too cumbersome to be treated directly. In view of 
the schematizations already introduced, an exact solution of these 
equations would hardly be worth while. However, we may derive ap- 
proximately some general properties of the solutions, without actually 
solving the equations. 

Let us again consider the ratio W,/W.. As we have seen, it orig- 
inally increases, tending to a constant value given by (31). This value, 
however, remains constant only as long as 7, «, and ¢, are constant. 
Since this is not the case, W,/W,. will change. From (34) and (36) it 
follows that n,/n. remains constant, but that n,’/n, decreases. Equa- 
tion (37) shows that e, decreases from &, = ¢, to & = @. < &, ; while 
é, increases. The quantity 7 decreases, because n,'/n. decreases. This 
will result in a decrease of the numerator of (31) and an increase of 
the denominator, hence in a decrease of W,/W.. We have specifically 
considered the situation in which originally dW,/dt > dW,/dt, in 
other words, when [cf. equ. (18) and (19)]: 


+ Wof (yn) +b > —b+ VbwF(m). (39) 
We shall investigate how inequality (39) is affected by a variation of 
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n. In order to discuss this, we must make same assumption about 
f(m). If f(y) is of the form (7), then very roughly we may approxi- 
mate it for small values of » by 


f = foay (40) 
in the range from y = 0 to 7 = 1/a, and by 
f= fo (41) 


for n > 1/a. In other words, we substitute instead of the curve (Fig. 
1) two segments of straight lines. Since a variation of y will affect 


FIGURE 1 


f(y) only for smaller values of » , we may substitute (40) into (39). 
We shall also neglect in the first approximation the changes of «, and 
e.. As a matter of fact, we shall considerably simplify our formulae, 
without greatly affecting their generality, by putting «, = « = 0. 
Making the above substitution into (39) and substituting an equality 
sign for the inequality, we obtain the following equation for determi- 
nation of * the value of » for which inequality (39) breaks down. 


Wof + 2b = V bwof .an* (42) 
Squaring and solving for 7* we find two positive roots 
b 4b 


Since equations (43) are obtained by substituting equ. (40) into 
(39), and since equ. (40) holds only for 7 < 1/a, therefore the above 
argument, as well as its subsequent consequences, holds only if the 
larger root 7° is less than 1/a , or if 4b/wofoa < 1. Due to the ap- 
proximation used, our results hold only within a restricted range of 
the constants. The investigation of the more general cases will be giv- 
en elsewhere. The meaning of the two roots becomes clear upon in- 
spection of (39). It can be easily shown that for «, = «, = 0 inequal- 
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ity (39) holds for 4 > ° and for n < 7° , while for »* < 4 < 7} the 
inequality (39) is reversed. Physically, the meaning of this situation 
can also be easily understood. A decrease of 7 means a decrease of 
proper supervision or organization of the work of individuals of type 
II, and a corresponding decrease in output. For 7° < 4 < 73, this su- 
pervision becomes too small to enable class J to retain a sufficient 
amount of goods relative to J]. Still the work of class IJ produces 
enough goods to make both dW,/dt and dW./dt positive. For 1 > 7° , 
both sides of (39) are positive. When 7 decreases still further, ap- 
proaching zero, the individuals of class JJ cannot produce enough and 
again dW,/dt > dW./dt, but both are negative. The first class loses 
less than the second. 

We thus see that, since 7 decreases with time, following a period 
of “concentration” of accumulated wealth in class J, there is a period 
of “deconcentration.” The beginning of the “decline” of class J is giv- 
en by the equation 


(44) 


in which we have to substitute for 7; the expression (43), for 7 the 
expression (38), and then introduce for n,’ and n, the expressions 
(34) and (36). This gives: 


ON, —1) + _ 4b 


Solved with respect to ¢: 
1 (1 ) No W of o@ 
In order that t should be real we must have 
> ANo:Wof oA (47) 


which imposes an upper limit on 7;/% 2. The meaning of this be- 
comes clear when we consider that with the assumption made here 
about f(7), an increase of %;/N%o2 above a certain limit leaves f(n) 
and therefore dW,/dt and dW./dt practically unchanged. On the other 
hand, 7,’ tends, as seen from (36) and (34), to an, . Hence y = n,'/n, 
tends to the value an,/n. = any;/N%.. If originally there has been a 
very large “reserve” of %;/%»2 , then this final value of 4 may still be 
sufficient, under these conditions, to keep dW,/dt > dW,/dt. In real- 
ity such a case is very unlikely to occur. It would actually mean a very 
large “initial reserve” of executive and supervisory personnel. In any 
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case, the ratio W.,/W. would decrease due to the decrease of ¢, and 
increase of «,, which we neglected in our approximation. 

Moreover, we must consider that gradually the number 1’, of in- 
dividuals of type J in class JI is increasing. When the ratio 7’ = n2'/ne 
becomes sufficiently large, a new class of I’ of “supervisors” will be 
formed within class II , and I’ will begin to compete with class I . De- 
pending on the choice of the constants, this may happen even before 
class J “declines” appreciably spontaneously. 

The rise of class J’ and its competition with J will begin when 7 
satisfies the equation 


+ Wof (yn) + b— 2\/bwof 
(48) 


= + wf (1') +b 


which requires that 7 = 7’ and in which we have to put [because of 
(34) and (36) ] 


(€8* —1) + 


_ a(e®*—1) 
This gives: 
(1 — a) M1 + (50) 


a — No1) 


B 


We may consider a more complicated case, in which, due to the differ- 
ence of the amounts of wealth already accumulated, the competitor in 
order to succeed must start with more stringent conditions than (48). 
For instance, we may require that the right-hand side of (48) ex- 
ceeds the left-hand side by an amount which is a function of the dif- 
ference of accumulated W, and W.. This will give for t, instead of 
(50), an expression which will involve such constants as ¢,, &, fo, @ 
and b. 

The interesting result of this study is that it shows us a simple 
picture of the “birth” of a “wealthy” class, with an ensuing initial 
“concentration” of “wealth,” followed by a “decline” and finally suc- 
ceeded by a class composed of different individuals. While a differen- 
tiation according to accumulated wealth remains, yet the class distinc- 
tion shifts with time from one group of individuals to another. When 
developed much farther, such a study may perhaps be connected with 
P. Sorokin’s interesting study on the “life-span” of different social 
organizations. Comparisons of calculated life span (e.g., equ. 46) 
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with the observed ones may give us the values of some of the con- 
stants involved. 

The most interesting further developments, however, would be in 
connection with our previous study (1) of the life-span and interac- 
tions of classes, which exercise a control due to their initiative, re- 
gardless of the amount of wealth. The existence of a “wealthy” class 
is, after all, possible only when the majority of the individuals in 
class IJ regulate their behaviour according to the principles involving 
the sanctity of private property. Otherwise, they may take away the 
accumulated wealth of class J by sheer force, due to their numerical 
prevalence. Thus, the existence of such a class J presupposes either 
an influence of the individuals of this class on those of type JI, or the 
existence of another “influencing” group, which determines the be- 
haviour of individuals of type IJ. In the latter case, any changes 
of the interrelations between the influencing group and the rest of the 
population will have strong repercussions on the relation of class I to 
class JJ and on W,/W.. Those kinds of relations will form the object 
of future studies. 

The reader may now better understand the reasons why we did 
not avail ourselves of the much more developed mathematical eco- 
nomic theories. We could have introduced much less schematized pic- 
tures, discussing directly questions of cost of production, overhead, 
marginal profits, fluctuations of demand, etc. (2, 3). But even the 
very schematic picture used here leads us to rather complicated equa- 
tions. At this stage of the development, the introduction of the above- 
mentioned complications would make the task hopelessly difficult. In 
the future, however, by successively more complex steps we should 
attempt to take all these very essential details into account. 


II 


We have considered in a previous paper various types of social 
interaction in which a population of individuals has been divided into 
two or more classes. However, hitherto we have not paid any atten- 
tion to the spatial distribution of the individuals belonging to differ- 
ent classes. In this section we shall illustrate by a few simple ex- 
amples how possible effects of different spatial distribution could be 
studied mathematically (6). Again we select intentionally oversim- 
plified examples, which in their simple form may have no counter- 
part in reality. We are here concerned not so much with deriving 
equations for actual cases as with illustrating the methodological 
principles. 

We shall discuss here a case which, while in its present form 
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purely theoretical and quite abstract, may in its eventual develop- 
ments throw some light on the relation between rural and urban popu- 
lation. 

Suppose that out of N individuals a number N, draw their means 
of existence chiefly from land cultivation. Let them be distributed 
on the average uniformly through an area suitable to produce the 
necessary amount of land products. On the other hand, let the N, re- 
maining individuals gather together in cities and produce goods that 
do not require directly any land. We have 


N=N.+N,. (51) 


Let the amount of goods produced by a person on, the land be p, per 
unit time, while the corresponding amount produced by a person in 
the city be p,. In general p, and p, are not constant, but both may be 


functions of N, and N,,, thus 
Pu = fu(Nu, Nr) ; pr =f-(Nu, Nr). (52) 


Thus, for instance, for a land very scarcely populated and not used 
to the full extent, p, may be approximately constant and determined 
by a constant fertility of the soil and by the average constant capac- 
ity of a man to produce work. However, as N, increases, the total 
amount N,p, of goods produced must necessarily tend to a constant 
value, determined by the maximum possible fertility of the soil. Hence 
», must decrease with increasing N,. The law of decrease will of 
course be rather complex and depend on many factors. As an illus- 
tration, we may consider just a simple form 


a, 
where @, and a» are constants. In general, however, the value of p, 
may depend also on N,,. 

Now we may consider different cases of behaviour of the individ- 
uals. A simple case is when every individual tries to produce as much 
as he can. If for a given N, and N, = N —N,, py > p,, then indi- 
viduals from the country will migrate to the city, until, due to the 
change of N, and N,, produced by this migration, p, becomes equal 
to p,. If pu < p,, amigration from city to the country will take place. 
Since, because of (51), N, = N—N,,, p, and p, in (52) are functions 
of N and N,,. Hence the requirement 


Du = Dr (54) 


gives us an equation for the determination of N, for a given total N . 
To illustrate, let us assume that in the city p, = Const., while 
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p, is given by (53). Since the maximum value of p, is equal to @,/a,, 
and p, decreases monotonically with increasing N,, in order for equa- 
tion (54) to be possible at all, we must have 


a, 
“us 55 
(55) 
or 
(56) 
Put 
a, 
57 
(57) 


Equations (54) and (53) give: 


(58) 
or because of (51): 
ay 
(59) 
From (59) and (57) we have: 
(60) 


Thus, the ratio of urban population to the total population increases 
with increasing total population N , tending to 1 for N = o. 

We may consider a different mode of behavior of the individuals. 
For instance, as a purely theoretical example, we may consider the 
case where every individual] tries to increase the total amount 


(61) 


of goods produced, and according to this aim chooses either the urban 
or the rural occupation. In this case we have to look for such a value 
of N, which makes G a maximum. This is given by the equation 


(62) 


into which we substitute for p, and p, the expressions (52), after ex- 
pressing N, in terms of N and N, by means of (51). 

Again using for p, the expression (53) and putting p, = Const., 
equation (62) becomes: 
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a,/a, 
equa- 


(55) 


(56) 


(57) 


(58) 
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an, (Nee + N—N,+ As 


(63) 


which gives, after elementary operations: 


Nv=N+aq— (64) 
From (64) we have 
Nu _ 1 
Woity 
Because of (56) the expression in parentheses is negative. For 
(66) 


NV Du 


N./N is positive and increases with N , becoming equal to 1 for N 
= o. The reversal of the inequality sign in (66) gives N./N < 0, 
which is physically impossible. In this case, N, actually will be equal 
to zero, which means that all population in that case is rural. 

We may consider a somewhat more complicated situation. Let 
each urban individual produce p, and consume c, per unit time, and 
each rural individual produce p, and consume c, per unit time. Let 


the rural individuals supply the urban with an amount N,g, of rural 


goods and receive for each unit of those goods @ units of urban prod- 
ucts. As in the preceding section, we have a demand function con- 


necting g, with 6, and we again put 


b 
(67) 


Putting 


and using for p, the expression (53) and again considering the case 
&, > 0 with p, = Const., we have: 


a, 
é, = Const. >0 ; Noa, — Cr; 
(69) 
—c ; 


We have now for the rate of change of the total accumulated goods 
for the urban and rural population: 
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= LVyEy + ON ’ 
(70) 
dt 
Because of (51) and (67), (70) gives 
dW, b 
at = wéu + (N — Nu) (90 — 5) (1 — 8) 
(71) 
dW, _ b 
dt = — =) (1 — 0) 
The per capita rates v, and v, are: 
N—N, b 
(90 — 5) (1— 6) ; 
(72) 


Consider again the case where every individual chooses his occupation 
in such a way as to make his per capita rate of accumulation as large 
as possible. For a fixed 6, we then have a migration either to or from 
the city until 


Vy, = Vr. (73) 


Because of (51) and (69), equation (73) gives a relation between 
N, N, and 6, (or N,N, and 6). For a fixed 6 and N, it will give us 
the values of N, and N,. But, because of (72), this also fixes the val- 
ues of v, and v, in terms of 6 and N. If all individuals try to make 
their v, and v, = v, as large as possible, they will agree on such a @ 
as to make v, and v, a maximum. The details of this problem in this 
form present some interesting features and will be studied separately. 

As emphasized above, the assumptions at the basis of these con- 
siderations are too simple and crude to lead to any results that may 
have applications in real cases. The general method, however, can 
be applied to cases of any complexity as it is hoped to be done in the 
future. The simple requirements (54) and (73) will have to be re- 
placed by more complex ones, which will take into account the vari- 
ability of p. , cu , p, and c, from individual to individual. This will re- 
sult in the situation that, while for some individuals, for a given 
N./N , a migration to city will be advantageous, for others an oppo- 
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site migration will be indicated. This will complicate the problem con- 
siderably, but also will bring it closer to actual cases. A combination 
of the considerations of this paper with those of the previous one (1) 
looks particularly interesting and promising. 


The author is indebted to Mr. H. D. Landahl for a discussion of 


this paper and for the checking of calculations. 
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